首页> 外文OA文献 >A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance
【2h】

A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance

机译:判别训练有限状态的条件随机场   字符串编辑距离

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The need to measure sequence similarity arises in information extraction,object identity, data mining, biological sequence analysis, and other domains.This paper presents discriminative string-edit CRFs, a finitestate conditionalrandom field model for edit sequences between strings. Conditional randomfields have advantages over generative approaches to this problem, such as pairHMMs or the work of Ristad and Yianilos, because as conditionally-trainedmethods, they enable the use of complex, arbitrary actions and features of theinput strings. As in generative models, the training data does not have tospecify the edit sequences between the given string pairs. Unlike generativemodels, however, our model is trained on both positive and negative instancesof string pairs. We present positive experimental results on several data sets.
机译:在信息提取,对象识别,数据挖掘,生物序列分析和其他领域中,需要测量序列相似性。本文提出了判别性字符串编辑CRF,这是一种用于字符串之间编辑序列的有限状态条件随机场模型。条件随机场相对于生成问题的方法(例如pairHMM或Ristad和Yianilos的工作)具有优势,因为作为条件训练的方法,它们允许使用输入字符串的复杂,任意动作和特征。与生成模型一样,训练数据不必指定给定字符串对之间的编辑序列。但是,与生成模型不同,我们的模型在字符串对的正例和负例上进行训练。我们在几个数据集上给出了积极的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号